## Using rownames(importance_df) as id variables

Can we predict the endpoint C. difficile CFU with the community structure at Day 0?

Endpoint has data from Day 2,3,10 - might be misleading since contains CFU from different data points

Does Day Community structure predict C. difficile CFU of that day?

RF all mice/days together RF cfu vs community by day

## Using rownames(importance_df_day) as id variables

Does removing the early euthanized samples improve prediction in days 1 to 3?

## Using rownames(importance_persist) as id variables

## Using rownames(importance_persist_day) as id variables

Eliminating mice euthanized early from the RF model gives similar R^2 and MSE, bu there is a slight advantage to community OTU features of Day 0 to predict Day1 CFU. As well as the R^2 value increases with increasing features, whereas when all mice are used the day 1 R^2 only decreases with increasing features. Of note, accompaied by this is an increase in the % MSE attributed to OTU15 (Akkermansia), which has seemed to stand out in all other days/analysis.Interestingly, akkermansia does not appear to have the same relationship when compared to the same day cfu and community. This could suggest akkermansia is promoting the intial colonization of c difficile. At first glance, OTU135 (Coriobacteriaceae) seems to stand out for predicting cfu of the same day as well as from day 0.


Can we predict moribundity with the community structure at Day 0?